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POLYPEPTIDES MIMICKING THE ACTIVITY D£ HIiMAH 

pPVTHROPOIETIN 

The present invention refers to polypeptides having 
a sequence obtained by the informational spectra method 
and the ability of stimulating the production of 
reticulocytes and red blood cells from bone marrow cells 
5 as well as hemoglobin synthesis and iron uptake. 

Human erythropoietin (EPO), an hormone playing a 
fundamental role in erythropoiesis , is a glycoprotein 
having a molecular weight of about 34-38 kd, the primary 
structure of which is shown in Figure 1. 
10 EPO is presently obtained by recombinant DNA 

techniques using eukaryotic cells which can glycosylate 
the expression product. 

The availability of shorter polypeptides, possibly 
active even in unglycosylated form mimicking the 
15 biological activity of human EPO, would be an highly 
desirable goal allowing the convenient preparation by 
synthetic procedures. 

It has now been found that polypeptides, having in 
the informational spectrum obtained by Fourier 
20 transformation according to the informational analysis 
method substantially the same frequencies of natural 
erythropoietin, have substantially the same activity of 
human erythropoietin. 

The informational analysis method (ISM), first 
25 disclosed by Veljkovic V. et al. in IEEE Trans. 3iomed. 
Eng. 32, 337 (1985 ); Cancer Biochem. Biophys. 9, 139, 
1987 ; Phys. Rev. Lett. 29, 105, (1972 ) and Phys . Lett. 
45A, 41, (1973 ) the content of which is herein 
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incorporated by reference, is based on the analysis of 
the information encoded in primary structure which is 
expressed by molecular electric oscillations propagating 
through polar environment. Based on the previously 
5 demonstrated strong correlation between electron-ion 
interaction potential (hereinafter EIIP) [Veljkovic V. , 
A theoretical approach to preselection of carcinogens 
and chemical carcinogenesis, Gordon & Breach Sci. Pub., 
New York, 1980; Politzer P. and Truhlar D. G. , Chemical 

10 applications of atomic and molecular electrostatic 
potential, Plenum Press, New York, 1981); Politzer P., 
Toxicol, Lett. 43, 227 (1988)] it has been proposed that 
information expressed by electric oscillations is 
encoded in protein primary structure by distribution of 

15 the values of EIIP of amino acids. 

According to this approach, the protein sequences 
are transformed into signals by assignment of numerical 
values to each amino acid. These values correspond to 
EIIP [Veljkovic V. and Slavic I., Phys . Rev. Lett., 29, 

20 105 (1972); Veljkovic V. Phys. Lett., 45A, 41 (1973)]. 
The signal obtained is than decomposed in periodical 
function by Fourier transformation. The result is a 
series of frequencies and their amplitudes. The obtained 
frequencies correspond to the distribution of structural 

25 motifs with defined physico-chemical characteristics 
responsible for biological function of protein. When 
comparing proteins which share the same biological or 
biochemical function, the technique allows detection of 
code/frequency pairs which are, specific for their common 

30 biological properties. The method is insensitive to the 
location of the motifs and, thus, does not require 



WO 97/49729 



PCT/EP97/03228 



3 

previous alignment of the sequence. The ISM was 
successfully applied in structure/function analysis and 
flg novo design of peptides [Cosic I. and Nesis D., 
Eur. J. Biochem., 170, 247 (1988); Skerl V., and Pavlovic 
5 M., FEBS Lett., 239, 1411 (1988); Veljkovic V. and 
Metlas R . , Cancer Biochem. Biophys., 10, 191 (1988); 
Cosic I. et al., Biochemie, 71, 333 (1989); Lalovic D. 
and Veljkovic V., Biosystems , 23, 311 (1989); Cosic I., 
Resonant recognition model of protein-protein and 
10 protein-DNA recognition, in Bioinstrumentation and 
Biosensor (edited by Weis D.L.), Marcel Dekker, Inc., 
New York (1990); Cosic I. et al . , Eur. J. Biochem. 198, 
113 (1991); Cosic I. and Kearn M.T. W. , J. Mol. 
Recognition., 4, 57 (1991); Veljkovic V. et al . , 
15 Biochem. Biophys. Res. Commun. , 189, 705 (1992); 
Krsmanovic B. et al., WO 93/17108. 

An object of the invention is provided by EPO 
muteins having an homology degree with natural 
erythropoietin lower than 60% or polypeptides having 
20 from 20 to 100, preferably from 15 to 70 amino acids, 
characterized by informational spectrum having 
substantially the same frequencies as found in the 
informational spectrum of natural erythropoietin. 

The muteins or polypeptides according to the 
25 invention may be designed so as to include appropriate 
0- or N-glycosilation sites even though it has been 
surprisingly found that glycosilation is not always 
necessary for the biological activity. 

More particularly, the muteins or polypeptides of 
30 the invention are characterized by the frequency 
component 0.312 ± 0.004 in the informational spectrum 
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and at least one of the following frequency components: 
0.023, 0.156, 0.180, 0.185, 0.258, 0.273, 0.285, 0.363 
and 0.500 determined with the accuracy of I 0.004. 

Preferred muteins of the invention are shown in 
5 Figures 2a and 2b (Sequence Id n. 1 and 2) whereas 
preferred polypeptides are shown in Figure 3 (Sequence 
Id n. 3, 4, 5 and 6). In Figures 1, 2a and 2b, the 
predicted amphipathic a helices are underlined with 
double line (====). Residues predicted to be on the 

10 surface are underlined. Each of the three N-linked and 
one of O-linked glycosylation sites is designated by an 
asterisk. The mutations are designated by small letters. 

Another group of preferred peptides according to 
the invention and having 16-40 aminoacids, is 

15 represented by the following general formula: Xl-Tyr-X2- 
Cys-X3-X4-Gly-Pro-X5-Thr-Trp-X6-Cys-X7-Pro-X8, where XI 
= Thr, Gin, His or Asp, X2 = Ser, Asn, Gly, Pro or Ala, 
X3 = Thr, Gin, Asp, Ser, Asn, Gly, Pro, Ala, lie, Val, 
Phe, Lys, Tyr, Met or Glu, X4 = Phe , Val or Met, X5 = 

20 Leu, He, Met or Val, X6 = Leu, He or Val, X7 = Lys, 
Arg or Asp, and X8 = Gin, Thr, His, Ser, Ala, Val or 
Leu. Optionally the peptide may be cyclised or 
dimerised . 

These peptides are able to bind to. the 
25 erythropoietin receptor and show in the IS the frequency 
component 0.312 and the corresponding amplitude A > 
0.11. 

The invention also refers to polynucleotide 
sequences coding for said muteins or polypeptides, to 
30 expression vectors comprising said polynucleotide 
sequences and to hosts transformed or transfected by 
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5 

said vectors, as well as to nucleotide sequences which 
hybridize to the above mentioned coding sequences. 

The polypeptides sequence of the invention are 
determined by a procedure involving the following steps: 

1. determination of the consensus characteristic 
frequencies for the EPO molecules; 

2. derivation of a new numerical sequence of the 
desired length having the same characteristic 
frequencies using inverse Fourier transformation; 

3. determination of the amino acid corresponding to 
each element of this new numerical sequence from 
values of EIIP (see Table 1): 
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Table 1 



Amino acid EIIP [Ryl 



5 



10 



15 



20 



L 


0 .0000 


I 


0.0000 


N 


0.0036 


G 


0.0050 


V 


0.0057 


E 


0.0058 


P 


0.0198 


H 


0.0242 


K 


0.0371 


A 


0.0373 


Y 


0.0516 


W 


0.0548 


Q 


0.0761 


M 


0.0823 


S 


0.0829 


C 


0.0829 


T 


0.0941 


F 


0.0946 


R 


0.0959 


D 


0.1263 



25 * Ry = Rydberg unit 

The polypeptides may be obtained "by conventional 
methods of peptide synthesis or by known recombinant DNA 
techniques . 

The polypeptides or muteins of the invention may be 
30 administered to humans or animals in form of suitable 
pharmaceutical compositions, usually but not exclusively 
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10 



15 



20 



25 



30 



7 

to be administered parenterals . Said compositions will 
contain from 1 to about 100 mg of mutein or polypeptide 
for the treatment of the same pathological conditions 
presently treated with human or recombinant EPO. 

The following examples further illustrate the 

invention. 

fiyample 1 

ft n fflyc -is of .iminn nrjfl aegasnceji * v the JSM 
in the first step of the ISM analysis each 
constitutive element (amino acid) in analyzed sequence 
is represented by corresponding EIIP value. For 
calculation of EIIP the following expression derived 
from the "general model pseudopotential" [Veljkovic V. 
and Slavic I., Phys. Rev. Lett., 21. 105 (1972); 
Veljkovic V., Phys. Lett., A5A, 41 (1973)] was used: 

W = 0.25 Z* sin (1.04 * Z*)/2 d> 
where Z* is the average quasi-valence number determined 



m 



Z*-= E »iV» (2) 
i=l 

where Z is the number of valence electrons of the i-th 
atomic component, H - the number of atoms of the i-th 
component, m and N - the number of atomic components and 
total number of atoms in the side group, respectively. 
The values of EIIP for side groups of amino acids 
calculated in accord with Eg. (1) are given in Table 1. 

The numerical serie determined in this way is 
finite-lenght deterministic discrete signal containing 
information corresponding to selective long-distance 
interaction among biological macromolecules . In order to 
analyze this information, the obtained numerical 
sequence was subjected to discrete Fourier 
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transformation (DFT), which is defined as follows 
[Rabiner R. , L . and old B., Theory and applications of 
digital processing. Prentice-Hall Inc., nglewood 1975}]: 

5 X(n) = 2 x < m > e N ' n = !> 2 N/2 (3) 

Here x(m) is the m-th of a given numerical series, and 
X(n) are coefficients of DFT. The coefficients are 
describing the amplitude, phase, and frequency of 

10 sinusoids from which original signal consists. The 
absolute value of complex DFT coefficients determines 
the amplitude spectrum which is in the ISM defined as 
informational spectrum (IS) and represented by the 
following equation : 

15 S(n) = X(n)X*(n) = jX(n}| 2 , n** 1, 2, ,N/2, (4) 

It was assumed that points in analyzed numerical 
sequences are equidistant with the distance d - 1. In 
this case the maximal frequency in the spectrum is 
F max = 1/2d = °- 5 " Jt is important to note that the 

20 frequency range is independent of number of points in 
the sequence. The total number of points in the sequence 
(i.e. number of amino acids in the analyzed primary 
structure) influences only the resolution of IS. In the 
case of an N-point sequence, the resolution equals 1/N. 

25 The minimal lenght of sequence that can be analyzed by 
ISM is determined by the desired resolution of the 
spectrum. Therefore, this number is determined by the 
expected number of peaks which are to be strictly 
separated and cannot be exactly defined. The minimal 

30 length of sequence which can be analyzed by ISM with 
suitable accuracy is 16 amino acids. 
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In this way, the information primarily defined by 
the sequence of symbols representing amino acids is 
presented in spectral form which is more suitable for 
mathematical analysis. It is important to note that ISM 
5 does not influence this information and represents only 
a tool for its analysis (like the prism which decomposes 
the white light in its spectral components). Each 
frequency in the IS represents a particular 
informational component encoded in the primary structure 
10 by regularly distributed structural motifs with similar 
electronic properties. 

Example 2 

Ep p-H c.Rt.iov of the techniqu e — i n sample 1 — t£ — th& 
analysis of spo proteins 
15 The algorithmic procedures were applied to the 

mammalian EPO protein sequences. 

The analysis procedure comprises the following 

steps : 

1. each amino acid sequences was converted to the 
20 numerical sequence by representing each amino acid 

with the corresponding value of the EI IP; 

2. this numerical sequence was converted into a 
numerical spectrum using fast Fourier transform 
(hereinafter FFT); 

25 3. spectra were mutually compared using cross-spectral 
analysis with the aim to extract common frequency 
components . 

From the analysis of cross-spectra of EPO molecules 
from various mammalian species (mouse, rat, rabbit, 
30 ship, monkey and human) one can deduce a set of 
characteristic frequencies which predominates in the 
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obtained CIS. In Table 2 all characteristics frequencies 
in the EPO CIS with S/N > 1 are given: 

Table 2 



Fl 


0.023 


F2 


0.156 


F3 


0.180 


F4 


0.195 


F5 


0.258 


F6 


0.273 


F7 


0.285 


F8 


0.312 


F9 


0.363 


F10 


0.500 



All frequency components in Table 2 are determined 
15 with accuracy of ± 0.004. 

From the obtained results, it is possible to 
conclude that information which is essential for 
biological activity of the analyzed EPO molecules is 
completely determined with the set of characteristic 
20 frequencies given in Table 2. 

Example 3 

Application of the technique in example 3 1£ thSL 

analysis of EPO-EPOR interaction 

In order to determine which of the characteristic 
25 frequency components from Table 2 determines the 
information that is essential for human EPO-EPOR 
interaction, the cross-spectrum between these two 
proteins is obtained. 

The characteristic frequencies corresponding to the 
30 first 15 amplitudes in EPO-EPOR cross-spectrum are given 
in Table 3. 
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10 



15 



20 



25 



30 







F 


S/N 


0. 311 


10. 6 


0.258 


7.5 


0.272 


7.3 


0.113 


7.0 


0.361 


7.0 


0.498 


5.2 


0.430 


5.1 


0. 158 


5.1 


0.031 


4.6 


0. 382 


4.6 


0.154 


4.4 


0. 283 


4.3 


0.275 


3.9 


0.347 


3.9 


0 . 008 


3.5 



The results presented in Table 3 show that the main 
part of information corresponding to human EPO-EPOR 
interaction is determined by the frequency component 
0.311. It is also important to note that, taking into 
account accuracy of ± 0.004 in the determination of 
frequency values, 8 of 10 characteristic frequencies 
from the EPO CIS (Table 2) are also contained within the 
first 15 characteristic' frequencies in the human EPO- 
EPOR cross-spectrum (Table 3). 

F.vanrole 4 

frpniir-ntio n nf the technique in pyamplft 1 to assign of 

KPO muteins 

Once the characteristic frequencies for EPO protein 
family had been determined, it was possible to design 
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EPO muteins introducing a large number of amino acid 
substitutions in human EPO . The main condition that must 
be satisfied in the design of these muteins is the 
conservation of IS of human EPO. 
5 In Figure 2a, the primary structure of mutein- 1 

generated by substitution of 99 (56.6%) amino acids in 
human EPO is given (Sequence Id n. 1). The IS of this 
mutein is given in Table 4 and Figure 4a. As can be 
seen, this IS contains all 10 characteristic EPO 
10 frequencies from Table 2, as well as other frequency 
components corresponding to first 15 amplitudes in IS of 
the human EPO. 

Table 4 



IS (EPO) 


IS (mut. 1) 


IS (mut. 2) 


0. 359 


0.359 


0.359 


0. 500 


0.156 


0.500 


0. 156 


0.500 


0.156 


0.195 ' 


0.285 


0.285 


0.258 


0.258 


0.258 


0.285 


0.195 


0.195 


0.203 


0.203 


0.273 


0.273 


0.273 


0.203 


0.180 


0.402 


0.180 


0.004 


0.180 


0.301 


0.445 


0 . 004 


0.004 


0.113 


0.113 


0.211 


0.023 


0.172 


0 .113 


0.312 


0.211 


0.312 


0.050 


0.312 


0.402 



30 In order to design the mutein that with higher 

probability will express EPO activity, also structural 
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characteristics that are important for this biological 
activity must be taken into account [Boissel jp. et al,, 
J. Biol. Chem., 268, 15983 (1993); Wen D., et al., J. 
Biol. Chem., 269, 22839 (1994)]. In Figure 2b (Sequence 
5 Id n. 2), the primary structure of mutein-2 generated by 
substitution of 72 (43.4%) amino acids in human EPO is 
given. This mutein besides conserved IS of human EPO 
(Table 4 and Figure 4b) also has preserved all 
structural elements that are important for the 

10 biological activity of the human EPO, including all 
glycosilation sites [Wen D., et al., J- Biol. Chem. 269, 
22839 (1994)]. The comparison of the alpha and beta 
propensity, hydrophilicity , hydrophobicity , solvent 
accessibility, antigenicity and secondary structure of 

15 the human EPO and mutein-2 are given in Figure 5-6. 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

5 (i) APPLICANT: 

(A) NAME: DIAPHARM LIMITED 

(B) STREET: Quay House, South Esplanade 

(C) CITY: St. Peter Port 

(D) STATE: Guernsey 

10 (E) COUNTRY: Channel Islands 

(F) POSTAL CODE (ZIP): GY1 4EJ 

(i) APPLICANT: 

(A) NAME: MARKOV I C Dejan 
15 (B) STREET: Via Andrea Verga, 5 

(C) CITY: Milano 

(D) STATE: Italy 

(E) COUNTRY: IT 

(F) POSTAL CODE (ZIP): 1-20144 

20 

(ii) TITLE OF INVENTION: POLYPEPTIDES MIMICKING THE 

ACTIVITY OF HUMAN ERYTHRO- 
POIETIN 

25 (iii) NUMBER OF SEQUENCES: 6 

(iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

30 (C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version 
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10 



15 

#1.30 (EPO) 



(2) INFORMATION FOR SEQ ID NO: 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 166 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi ) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 

15 Trp GlY GlY Arg Val Val Cys Asp Thr Arg He Val Glu Arg 
1 5 10 

Tyr Val Val Glu Trp His Glu Trp Glu Asn He Ser Ser Pro 
15 .20 25 

Cys Trp Glu Lys Cys Thr Val Asn Glu Asn Val Ser He Gly 
20 30 35 40 

Asp Ser His lie Asn Phe Tyr Trp Ala His Arg Met Glu lie 
45 50 55 

Pro Gin Gin Trp Leu Glu lie Ala Gin Pro Val Trp Val Val 
60 65 70 

25 Thr Glu Trp lie Val Arg Pro Gin Trp Val Val Val Asn Ser 

75 80 
Thr Gin Gly Ala Glu Gly Val Gin Val Lys Leu Asp His Trp 
85 90 95 

He Thr Pro Val Arg Thr Val Ser Ser Val Val Arg Trp Val 
30 100 105 HO 

Pro Trp Gin His Glu Trp Val Thr Gly Gly Asp Ala Ala Ser 

115 120 125 

Ala Ala Gly Val Arg Ser Val Ser Trp Asp Ser Phe Arg His 
130 135 I 40 
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Val Phe Arg Leu Tyr Thr Asn Phe Val Arg Gly His Val His 
145 150 

Val Tyr Ser Gly Glu Trp Cys Arg Ser Pro Asp Arg 
155 160 165 



(2) INFORMATION FOR SEQ ID NO: 2: 



(i) SEQUENCE CHARACTERISTICS: 

{A) LENGTH: 166 amino acids 
10 (B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



15 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 



Trp Gly Gly Arg Val Val Cys Asp Ser Arg Val Leu Glu Arg 
1 5 10 

20 Tyr Leu Leu Glu Ala His Glu Ala Glu Asn He Thr Ser Pro 
15 20 25 

Cys Trp Glu Lys Cys Thr Val Asn Glu Asn He Thr He Pro 
30 35 40 

Asp Ser His He Asn Phe Tyr Trp Ala His Arg Met Glu He 
25 45 50 55 

Pro Gin Gin Ala Leu Glu Leu Trp Gin Gly Ala Leu Leu Leu 
60 65 70 

Thr Glu Ala Leu Val Arg Pro Gin Trp Val Leu Val Asn Ser 
75 80 85 

30 Ser Gin Gly Ala Glu Gly Leu Gin Leu His Leu Asp His Ala 

90 95 

Leu Thr Pro Leu Arg Thr Leu Ser Ser Val Val Arg Trp Val 
100 105 110 
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Pro Trp Gin His Glu Trp Val Thr Gly Gly Asp Ala Ala Ser 

115 120 125 

Ala Ala Gly Val Arg Ser Leu Ser Ala Asp Ser Phe Arg Lys 
130 135 I 40 

5 Val Phe Arg Leu Tyr Thr Asn Phe Leu Arg Gly His Val His 

145 150 
Val Tyr Ser Gly Glu Trp Cys Arg Ser Gly Asp Arg 
155 160 165 

10 (2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 25 amino acids 
(B-) TYPE: amino acid 
15 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

Asp Pro Gin Thr Leu Val Asn Ser Ser Tyr Gin Lys His Asp 
1 5 10 

Tyr His Asp Pro Leu Asp Ala His Asp His Glu 
25 15 20 25 

(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 35 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 



Phe His Lys Arg Trp Ala Ala Ser Ala Ala Gin Trp Trp Thr 

15 10 

Ala His Arg Trp Met Phe Gly Tyr Asp Trp Lys Gin His Trp 

15 20 25 

10 Asp Glu Asn lie Asp Gin lie 
30 35 



(2) INFORMATION FOR SEQ ID NO: 5: 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 64 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

20 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 



25 Thr Gin Glu Trp Ala Gin Met Ala Ser Ala Phe Ala Trp Arg 
15 10 

Ser His Arg Gin Ala Glu Asn lie Asp Ala His Thr Glu Gin 
15 20 25 

Thr Ala His Lys Asp Ser Lys Met Gin Leu Ser Phe Lys Leu 
30 30 35 40 

Met Gin Thr Trp His Ala Arg Gin Trp Ala Ala Ser Ala Ala 
45 50 55 
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Glu Trp Asp Gin Met Gin Leu Gin 
60 



(2) INFORMATION FOR SEQ ID NO: 6: 



5 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 55 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
10 (D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
Gin Tyr Tyr Thr Lys Trp Trp Thr Gin Gin Gin Ala Tyr Asp 



15 



20 



1 5 10 

Thr Tyr Cys Gin Tyr Gin His Met Thr Val Asn Ser Lys Trp 
15 ' 20 25 

Arg Gin Leu His Asp Arg His Trp Trp Pro Gin Arg Pro Trp 

30 35 40 

Tyr Trp Gin Ala His Met Cys Trp Tyr Trp Cys Gin Gin 
45 50 55 
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CLAIMS 

1. A polypeptide mimicking the activity of human 
erythropoietin, having an amino acid sequence which is 
5 different from that of human erythropoietin and presents 
substantially the same frequencies as natural 
erythropoietin in the information spectrum obtained by 
Fourier transformation according to the method of 
informational analysis. 

1° 2. A polypeptide according to claim 1, which is an 
erythropoietin mutein having an homology degree with 
natural human erythropoietin lower than 60%. 
3. A polypeptide according to claim 1 or 2 , having 
from 15 to 70 amino acids. 

15 4. A polypeptide according to any one of claims 1 to 
3, having the frequency component 0.312 ± 0,004 in the 
informational spectrum and at least one of the following 
frequency components: 0.023, 0.156, 0.180, 0.195, 0.258, 
0.273, 0.285, 0.363 and 0.500 the accuracy being ± 

20 0.004. 

5. A polypeptide according to any preceding claim, 
having any of the primary sequences of sequences Id n. 
1-6. 

6. A polypeptide according to claim 1 including the 
25 amino acid sequence: Xl-Tyr-X2-Cys-X3-X4-Gly-Pro-X5-Thr- 

Trp-X6-Cys-X7-Pro-X8, where XI = Thr, Gin, His or Asp, 
X2 = Ser, Asn, Gly, Pro or Ala, X3 = Thr, Gin, Asp, Ser, 
Asn, Gly, Pro, Ala, He, Val, Phe , Lys, Tyr , Met or Glu, 
X4 = Phe, Val or Met, X5 = Leu, He, Met or Val, X6 = 
30 Leu, lie or Val, X7 = Lys, Arg or Asp, and X8 = Gin, 
Thr, His, Ser, Ala, Val or Leu. 



WO 97/49729 



PCT/EP97/03228 



10 



21 

7. a polynucleotide coding for a polypeptides 
according to any preceding claim. 

8. An expression vector comprising a polynucleotide 
according to claim 6. 

9. A pharmaceutical composition containing a 
polypeptide according to any of claims 1 to 5 in 
admixture with one or more suitable carriers or 
excipients therefor. 

10. A peptide having human EPO activity but which is 
not natural human EPO , substantially as described 
herein. 
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